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Abstract. We exploit a bijection between plane recursive trees and 
Stirling permutations; this yields the equivalence of some results previ- 
ously proven separately by different methods for the two types of objects 
as well as some new results. We also prove results on the joint distri- 
bution of the numbers of ascents, descents and plateaux in a random 
Stirling permutation. The proof uses an interesting generalized Polya 
urn. 



1. Introduction 

A plane recursive tree is a rooted plane (= ordered) tree obtained by 
starting with the root and recursively adding leaves to the tree. We la- 
bel the root by and the added vertices by 1,2, ... ; a plane recursive tree 
with n + 1 vertices is thus a labelled rooted plane tree (with labels 0, . . . , n) 
where the labels increase along each branch as we travel from the root. 
Since a new vertex may be joined to an existing vertex i in di + 1 positions, 
where di is the outdegree of i, the total number of positions for vertex n 
is Y17=o (di + 1) = 2n — 1 (and in particular not depending on the present 
shape of the tree), and thus the number of plane recursive trees with n+ 1 
vertices is (2n — 1)!! := 1 • 3 • • • (2n — 1). A random plane recursive tree with 
n + 1 vertices is obtained by randomly (and uniformly) choosing one of these 
(2n — 1)!! trees; equivalently, it is obtained by recursively adding n vertices 
to the root, each time choosing the position uniformly at random among 
the possibilities. Random plane recursive trees were studied by Mahmoud, 
Smythe and Szymahski [12|. They obtained, among other results, an as- 
ymptotic joint normal distribution for the numbers of vertices of outdegrees 
0,1,2; this was later extended to arbitrary outdegrees by Janson [nj. More 
recently, the distribution of the degree of the node with a given label has 
been studied by Kuba and Panholzer [llj]. See also 0] and the survey 14], 
where also other related types of trees are studied. 

The well-known depth first walk of a rooted plane tree starts at the root, 
goes first to the leftmost daughter of the root, explores that branch (recur- 
sively, using the same rules), returns to the root, and continues with the 
next daughter of the root, until there are no more daughters left. Note that 
every edge is passed twice in this walk, once in each direction. We find it 
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convenient to label the edges too in a plane recursive tree, using the labels 
1,2,... in the order the edges are added to the tree. Thus, edge j is the edge 
connecting vertex j to some earlier vertex. We code a plane recursive tree 
by the sequence of the labels of the edges passed by the depth first walk; a 
plane recursive tree with n + 1 vertices is thus coded by a string of length 
2n, where each of the labels 1, . . . , n appears twice. In other words, the code 
is a permutation of the multiset {1, 1, 2, 2, . . . , n, n}. Adding a new vertex 
n + 1 means inserting the pair (n + l)(n + 1) somewhere in the code, at one 
of 2n + 1 possible places (first, last, or in any of the 2n — 1 gaps between of 
consecutive labels) corresponding to the In + 1 places in the tree where a 
new leaf might be added. (We will in the sequel call all these 2n + 1 places 
'gaps', including the places before the first element and after the last.) This 
shows that the code determines the plane recursive tree uniquely. Moreover, 
it follows that the possible codes are exactly the Stirling permutations de- 
fined by Gessel and Stanley [6||: a Stirling permutation is a permutation of 
{1, 1, 2, 2, . . . , n, n} such that for each i < n, the elements occuring between 
the two occurences of i are larger than i. (The name comes from relations 
with the Stirling numbers, see [g].) 

Letting T n be the set of plane recursive trees with n + 1 vertices (and thus 
n edges) and Q n the set of Stirling permutations of length 2n, there is thus 
a simple bijection T n = Q n . We let T n denote a random plane recursive tree 
with n + 1 vertices, i.e., a (uniformly) random element of T n , and let similarly 
Q n denote a random Stirling permutation of length 2n, i.e., a (uniformly) 
random element of Q n ; the bijection T n = Q n thus yields a correspondence 
between the random objects T n and Q n . One of the purposes of this note is 
to show how this correspondence connects some previous results on random 
plane recursive trees and random Stirling permutations. We will also extend 
some of these results. (The correspondence is very simple, but we have not 
seen it utilized in the literature before.) 

Gessel and Stanley 0] studied the number of descents in Stirling permu- 
tations. This was recently continued and extended by Bona using the 
following definitions: If a\a2 ■ ■ ■ am is a Stirling permutation, say that an 
index i = 0, . . . , 2n (or the gap i,i + 1) is an ascent if a» < aj+i, a descent 
if ai > Oj+i, and a plateau if ai = Oj+i, where we set ao = a2 n +i = 0. 
(Thus is always an ascent and 2n a descent.) Let X n , Y n and Z n denote 
the numbers of ascents, descents and plateaux, respectively, in a random 
Stirling permutation Q n . Thus 

X n + Y n + Z n = 2n + 1. (1.1) 

Let N n d denote the number of vertices with outdegree d in the random 
plane recursive tree T n and let L n := N n Q be the number of leaves. It 
is immediately seen that in the correspondence above, leaves in the plane 
recursive tree correspond to plateaux in the Stirling permutation, and thus 
the number of leaves in the random plane recursive tree T n equals the number 
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of plateaux in Q n , i.e., 

L n = Z n . (1-2) 

As said above, L n was studied by Mahmoud, Smythe and Szymariski 
fl3 ] (note that our L n is their L n+ i). They proved, using simple recursion 
relations, 

In + 1 , , 

^L n = — - — , (1.3) 

V^L n = Y~ l \ ~- (1-4) 
9(2n - 1) 9 v ' 

and, using a generalized Polya urn (see Section [2] and Remark 12 .6|) the 
asymptotic normality 

Ln_2n/3 ^AT(0,l/9). (1.5) 



(All unspecified limits in this paper are as n — > oo.) Of course, in (|1.5p . we 
can replace 2n/3 by the exact mean (2n + l)/3 from (|1.3j) . and by (jl-4j) the 

result can also be written (L n — E L n ) / y/V&i L n — ; > N(0, 1). 

Bona [4] proved, among other things, that the random variables X n , Y n 
and Z n have the same distribution (reproved as Corollary 12.21 below) , and 
thus by ([LI) 

2n 4- 1 

EI n = Ey n =EZ n = ^, (1.6) 

and further, by first showing that the probability generating functions have 
only real roots, that these random variables are asymptotically normally 
distributed: 

X n - 2n/3 dY n - 2n/3 d Z n - 2n/3 ^ 



n \ n \ n 



We now see that the equality L n = Z n means that the results (jl.3p and 
(ll.6jl are equivalent, as well as (|1.5p and (II. 7h . Furthermore, p.4p yields 
also an exact formula for the variance of X n , Y n and Z n , which was raised 
as a question in [4J|. 

In Theorem 12. 41 we will extend (II. 7p to joint convergence to a joint normal 
distribution. 

Remark 1.1. Let C n ^ be the number of plane recursive trees with n + 1 
vertices and A; leaves, or, equivalently by the discussion above, the number 
of Stirling permutations of length 2n with k plateaux (or k ascents, or k 
descents). It is easy to see that C n ^ satisfies the recursion 

C n ,k = kC n - lik + (2n - fc)C n _i )fe _i (1.8) 

for all n > 2 and k > 1 (or k G Z), with Ci^ = 8\ k and C nj o = 0, see [6j, 
[1]. These numbers C n ^ are called second-order Eulerian numbers [§, 
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6.2]; in standard notation C n ^ = [8], and thus [12], for n > 1, 




P(L n = A;) = P(X n = fe) = P(Y n = k) = ¥(Z n = k)= (2n ly (1-9) 



Remark 1.2. Another combinatorial interpretation of the same numbers 
Cn,fc is given by Riordan [13J] as the number of trapezoidal words of length 
n with k distinct elements, where a trapezoidal word is a word a\ ■ ■ ■ a n 
with a,i G {1,2, ...,2i — 1}, i.e., an element of [1] x [3] x • • • x [2n — 1]. 
(Again, it is easy to verify the recursion (|1.8p .) Hence L n etc. also give 
the distribution of the number of distinct elements in a random trapezoidal 
word of length n. We do not know any interesting statistics of trapezoidal 
words that correspond to other statistics of plane recursive trees or Stirling 
permutations such as N n( i (d > 1) or the triple (X n ,Y n , Z n ). 

Remark 1.3. For the connections between the second-order Eulerian num- 
bers C n ^k and Stirling numbers, see e.g. 0], 0], [3], 0], 0. 

The correspondence between plane recursive trees and Stirling permuta- 
tions makes it natural to study also other statistics of one of these objects 
and see what they correspond to for the other object, thus giving more equal- 
ities of the type (II. 2ft . (In some cases the results are, however, disappointing 
in that the resulting statistics seem uninteresting.) 

Example 1.4. The number of plateaux in a Stirling permutation is, as 
discussed above, equal to the number of leaves in the corresponding plane 
recursive tree. It seems natural to consider ascents and descents also. It is 
straight-forward to see that an ascent corresponds to a non-root vertex in 
the tree that either has no sister to its left, or else has a higher label than 
the sister that is nearest to it to the left; similarly, a descent corresponds 
to a non-root vertex in the tree that either has no sister to its right, or else 
has a higher label than the sister that is nearest to it to the left; We can 
thus interpret the results on (X n ,Y n , Z n ) as results on the numbers of such 
vertices in a random plane recursive tree, but it remains to show that these 
numbers are interesting. 

Example 1.5. The distance D n k between the two occurences of /c in a ran- 
dom Stirling permutation of length 2n is perhaps a more interesting example. 

(k) (k) 

It equals 2S n — 1 where S n is the size of the subtree rooted at vertex k of 
the corresponding random plane recursive tree. The latter variable was stud- 
ied by Mahmoud, Smythe and Szymahski [12] using a Polya-Eggenberger 
urn; they obtained both exact formulas for the distribution, mean and vari- 
ance, as well as an asymptotic beta distribution: Sn^/n — > [3{\,k) as 
n — > oo for every fixed k > 1. (Recall that we label the root by 0, so the 
labels are shifted from fl2l].) These results immediately tranfer to results 
about D n k, for example, for any fixed k > 1, 



Dnk d 

2n 



as n — > oo. 



(1.10) 
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Example 1.6. The degree of the root of T n was also studied in [12|]. This 
corresponds for a Stirling permutation Q n to the number d such that the 
Stirling permutation is of the form a\ ■ ■ ■ a\02 ■ ■ ■ «2 • • • a>d ' " a d- Again, this 
is not of any obvious great interest. 



Mahmoud, Smythe and Szymahski [12| used a representation with a gen- 
eralized Polya urn to prove the asymptotic normality (jl.5p . (In fact, they 
proved joint asymptotic normality of N n d (d < 2), which was extended in 
[101 to all d, see also @, Example 7.6].) We use a similar urn to study the 
joint distribution of (X n ,Y n , Z n ). 

If we extend a Stirling permutation in Q n by inserting the pair (n + 
l)(n+l) at one of the 2n + 1 possible places (gaps), then the ascent, descent 
or plateau at that gap is destroyed and replaced by the sequence ascent, 
plateaux, descent at the three resulting new gaps. Consequently, the ran- 
dom vector (X n , Y n , Z n ) is described by the following generalized Polya urn 
model: 

Urn I. Consider an urn with balls of three colours, and let (X n ,Y n , Z n ) be 
the number of balls of each colour at time n. At each time step, draw one 
ball at random from the urn, discard it, and add one new ball of each colour. 
Start with (X 1 ,Y 1 ,Z 1 ) = (1,1,1). 

This urn model is completely symmetric in the three colours, and we thus 
immediately see the following. 

Theorem 2.1. For each n > 1, the distribution of (X n ,Y n , Z n ) is exchange- 
able, i.e., invariant under any permutation of the three variables. 

Corollary 2.2 (Bona [4]). X n = Y n = Z n . 

Remark 2.3. We have stated Theorem 12.11 and Corollary 12.21 for a fixed n, 
but the results extend to the processes (X n ) n , (Y n ) n , (Z n ) n . 

It is customary and convenient to formulate generalized Polya urns using 
drawings with replacement. In the case of Urn HI we thus restate the descrip- 
tion above and say instead that we draw a ball and replace it together with 
one ball each of the two other colours. In other words, Urn U is described 
by the replacement matrix 
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(2.1) 



We now easily obtain one of our main results. 
Theorem 2.4. (X n ,Y n , Z n ) are jointly asymptotically normal: 



n- 1/2 (X n - 2n/3,y n - 2n/3,Z n - 2n/3) JV(0,E) 



(2.2) 
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where the asymptotic covariance matrix is given by 




Proof. It is easily seen that the matrix A in (|2.ip has eigenvalues 2, —1, 
— 1. (Note also that A can be regarded as convolution with (0, 1,1) on the 
group Z3, and thus the eigenvalues are given by the Fourier transform of this 
vector.) In particular, the dominant eigenvalue Ai = 2 (the row sum in A), 
and all other eigenvalues lie in the half plane {Re A < Ai/2}. Consequently, 
0, Theorem 3.22] applies and shows the asymptotic normality (|2.2p . 

To identify the asymptotic covariance matrix E, we may use [9j, Lemma 
5.4 and (2.15), or Lemma 5.3] and some straight-forward calculations. We 
may, alternatively, avoid calculations completely by noting that the diagonal 
entries, which must be equal by Corollary |2.2l are 1/9 by ([1.70 : furthermore, 
the off-diagonal entries also are equal by Theorem l2.ll and the row sums in S 
are as a consequence of ([1.10 : hence the off-diagonal entries are —1/18. □ 

An exact formula for the covariances is easily obtained too. 

Theorem 2.5. For every n > 1, 

in 2 - 1) 

Cov(X n , Y n ) = Cov(X n , Z n ) = Cov(Y n , Z n ) = -j——L. (2.4) 

Proof. This can be proved using recursion formulas derived from the urn 
model. It is, however, simpler to use Theorem 12.11 and the already known 
variance (jl.40 . Indeed, by symmetry, the three covariances are equal. Fur- 
ther, by (ILTft . 

Var(X„) + Cov(X n , Y n ) + Cov(X„, Z n ) = Cov{X n ,X n + Y n + Z n ) = 0. 

Since X n = Z n = L n , we thus obtain from p. 40 

1 1 (n 2 — I) 

Cov(X n ,Y n ) = -- Var(X n ) = -- Var(L n ) = - ±—> . □ 

Remark 2.6. If we only are interested in the univariate distribution of the 
variables, we may combine two of the colours into one, and thus instead 
study Urn II: the two-colour urn with replacement vectors (0, 2) and (1, 1). 
This is the urn used by [13] to show the asymptotic normality (jl.50 . 

Remark 2.7. Urn [J is a rather special type of generalized Polya urn, since 
(using the formulation of drawing without replacement), the added balls do 
not depend on the drawn ball. This is perhaps even more striking in the cor- 
responding continuous-time multitype branching process (see (J, @, §V.9], 
[§]), which now can be described as follows: There are individuals of three 
types (colours). Each individual has an exponential lifetime, independent of 
all other individuals. When someone dies, one new individual of each of the 
three colours is born. Since it does not matter which individuals that die, it 
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might be thought that the result would be like taking a large number 3n of 
individuals, n of each colour, and randomly removing n — 1 of them. How- 
ever, this is not correct, not even asymptotically; in fact, a simple calculation 
of the asymptotic variance in this simplified model (where the number of 
remaining individuals of a given colour has a hypergeometric distribution) 
yields (4/27)ra instead of n/9. The reason is that although the individuals 
die independently, they are born together in triplets. Moreover, since the 
individuals in older triplets have larger probabilities of having died at some 
given instance, there is a positive correlation between the deaths (up to a 
given time) for the individuals in the same triplet; since these individuals 
have different colours, this tends to decrease the variances. 

Consequently, we drawn the conclusion (and warning!) that although 
Urn [I] is a very simple type of generalized Polya urn, the fact that the 
replacements do not depend on the drawn colour does not really simplify 
the arguments. (At least, we do not see any simplification.) This also em- 
phasizes that it is usually better to formulate generalized Polya urn models 
using drawing with replacement; in our case we than have the replacement 
matrix A in (|2.ip . which (although very nice in other respects) describes 
replacements that do depend on the drawn colour. 

3. Further comments 

The proof by Bona [4] of the asymptotic normality (jl.7p is completely 
different and is based on first showing that the probability generating func- 
tions have only real roots; thus the random variables can be represented as 
sums of independent Bernoulli variables. (These Bernoulli variables have in 
general irrational means and do not have any combinatorial interpretation.) 
This is a strong property that implies not only asymptotic normality (pro- 
vided the variance tends to infinity, as in this case); it has other desirable 
consequences too, for example it leads to explicit error estimates for the con- 
vergence to the normal distribution as well as to large deviation estimates 
(Chernoff bounds). 

The success of the generating function in this context suggests that it 
might be interesting and profitable to study the trivariate probability gen- 
erating function for (X n ,Y n , Z n ). 

Note that, by Remark 11.14 the univariate probability generating func- 
tion of L n (or Z n ) is (2n — l)!!" 1 times the generating function J2k^n,kX k , 
which has interesting properties studied in [3], 0], 0], 0]. It might 

be hoped that the trivariate generating function too has some interesting 
combinatorial properties. 
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